Structure-Aware Procedural Text Generation From an Image Sequence
نویسندگان
چکیده
It is an important activity for our society to create new value by combining materials. From daily cooking manufacturing industry, we often describe the way do it as a procedural text. As pointed some previous studies natural language understanding, one property of text its dependency context, which merging operations materials and can be represented graph or tree structure. This paper aims investigate impact explicitly introducing such structure on vision task generation from image sequence. To this end, propose (1) dataset, extends definition version (2) novel structure-aware model, learns context efficiently. Experimental results show that proposed method boost performance traditional versatile methods.
منابع مشابه
Procedural Text Generation from an Execution Video
In recent years, there has been a surge of interest in automatically describing images or videos in a natural language. These descriptions are useful for image/video search, etc. In this paper, we focus on procedure execution videos, in which a human makes or repairs something and propose a method for generating procedural texts from them. Since available video/text pairs are limited in size, t...
متن کاملTable-to-text Generation by Structure-aware Seq2seq Learning
Table-to-text generation aims to generate a description for a factual table which can be viewed as a set of field-value records. To encode both the content and the structure of a table, we propose a novel structure-aware seq2seq architecture which consists of field-gating encoder and description generator with dual attention. In the encoding phase, we update the cell memory of the LSTM unit by ...
متن کاملText Structure - Aware Classification
Bag-of-words representations are used in many NLP applications, such as text classification and sentiment analysis. These representations ignore relations across different sentences in a text and disregard the underlying structure of documents. In this work, we present a method for text classification that takes into account document structure and only considers segments that contain informatio...
متن کاملGroundtruth Image Generation from Electronic Text (Demonstration)
The problem of generating synthetic data for the training and evaluating of document analysis systems has been widely addressed in recent years. With the increased interest in processing multilingual sources, there is a tremendous need to be able to rapidly generate data in new languages and scripts, without the need to develop specialized systems. We have developed an approach that uses langua...
متن کاملAutomatic FDP/FAP generation from an image sequence
This paper presents an automatic FDP (Facial Definition Parameters) and FAP (Facial Animation Parameters) generation method from an image sequence that captures a frontal face. The proposed method is based on facial feature tracking without markers on a face. We present an efficient method to extract 2D facial features and to generate the FDP by applying 2D features to a generic face model. We ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2021
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2020.3043452